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Nevertheless, menus, advertisements or pages that cover multiple topics affect negatively 
the advantages of an implicit feedback technique that ... 

Keywords: implicit feedback, information needs, user model, user profile 

18 Content consum ption: PaqeTailor: re usable end-user customization for the mo bile jg 
&\ web 



Nilton Bila, Troy Ronda, Iqbal Mohomed, Khai N. Truong, Eyal de Lara 

June 2007 Proceedings of the 5th international conference on Mobile systems, 

applications and services MobiSys '07 
Publisher: ACM Press 

Full text available: *g) pdf( 1.27 M B) Additional Information: fulLcitstion, abstract, references, indexj.erms 

Most pages on the Web are designed for the desktop environment and render poorly on 
the small screens available on handheld devices. We introduce Reusable End-User 
Customization (REUC), a technique that lets end users adapt the layout of Web pages by 
removing, resizing and moving page elements, REUC records the user's customizations 
and automatically reapplies them on subsequent visits to the same page or to other, 
similar pages, on the same Web site. We present PageTailor, a REUC prototype b ... 

Keywords: customization, end-user, mobile web, small screen 



19 Data integration and sharin g ll:. Extractin g structured data from Web pages 
Arvind Arasu, Hector Garcia-Molina, Stanford University 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data SIGMOD '03 

Publisher: ACM Press 

Full text available: ffl pdf(587 99 KB) Additional Information: fulLcitaUon, abstract, references, citings, index 
^ terms 

Many web sites contain large sets of pages generated using a common template or layout. 
For example, Amazon lays out the author, title, comments, etc. in the same way in all its 
book pages. The values used to generate the pages (e.g., the author, title,..,) typically 
come from a database. In this paper, we study the problem of automatically extracting 
the database values from such template-generated web pages without any learning 
examples or other similar human input. We formally define a templa ... 
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<g> design 

^ September 2001 interactions, Volume 8 issue 5 
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www.articlebot.com Free Laptop, Free Year of AB Viral Contest Makes mP9^ red Link 

Dynamic web pag e - Wiki pedia, the fre e enc yclopedia 

The result of either technique is described as a dynamic web page, and both may be ... 
The dynamic page generation was made possible by the Common Gateway ... 
en.wikipedia.org/wiki/Dynamic_web_page - 26k - Cached - Similar pages 

Sw Tech.com - Dynamic Web Pa ge Generation 

A summary of the various technologies for doing Server-side dynamic page generation. 
Provides a cautionary note on how making the wrong technology decisions ... 
www.swtech.com/webdev/dynpage/ - 15k - Cached - Similar pages 

Buildin g D ynamic Web Sites with Topic Maps and XSLT 

Every topic link is rendered as a separate web page (WML card) ... Supply metadata for 
Natural Language Generation (System Associations) ... 
www.cogx.com/?si=um:cogx:resource:tmwsites - 24k - Cached - Similar pages 

Performan ce An alysis of Dyn amic Web Page Gener ation Technolo gies ... 

The Web has experienced phenomenal growth over the past few years, placing heavy load 
on Web servers. Today s Web servers also process an increasing number ... 
citeseer.ist.psu.edu/119628. html - 23k - Cached - Sjmilar_pages 

(WO/2001/057721 ) DYNAMIC WE B PAGE GENERATION 

Dynamic web page generation is optimized by reducing the processing overhead required 
to parse the web page HTML code for tokens and insert dynamic content. ... 
www. wipo.org/pctdb/en/wo.jsp?wo=2001 057721 - 14k - Cached - Similar p ages 

System for mana ging d ynamic web page g eneration requests by ... 

The present invention teaches a method and apparatus for creating and managing custom 
Web sites. Specifically, one embodiment of the present invention ... 
www.freepatentsonli.ne.com/5894554.html - 47k - Cached - Similar pages 

Offline dynamic web page g eneration - Patent 20030135819 

A method, computer program product, electronic document product, and data processing 
system for rendering web pages containing dynamic data is disclosed. 
www.freepatentsonline.com/20030135819.html - 51k - Cached - Similar pages 
[ More results from www.freepatentsonline.com ] 

ASP VB an d .NET Components for PDF HTML Im age and U pload - WebSu pergoo 

Let clients upload images onto your web site. Store images as files or in databases. Create 
dynamic PDF content. Make your web site extra sticky. ... 
www.websupergoo.com/ - 8k - Cached - Similar pages 

The Next Generation T.McrospfLQffice Online 

Microsoft Expression Web: Take advantage of the best of dynamic Web site design, ... 
Web for the professional Web designer. Top of Page Top of Page ... 

www.microsoft.com/frontpage/ - 37k - Cached - Simil ar pages 

System for managing dynamic web page generation req uests by ... 

System for managing dynamic web page generation requests by intercepting request at 
web server and routing to page server thereby releasing web server to ... 
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Dy namic web pag e cache - Patent 7096418 

A method for caching a dynamic web page, the method comprising: .... Alternatively, if 
cached dynamic pages were refreshed with sufficient frequency to ... 

www.freepatentsonline.com/7096418.html - 100k - Cached - Similar pag es 

Software Download: Dynamic Web Pa ge 

categorized by country. Software Listing: Dynamic Web Page .... and the checking 
frequency can be anywhere between once a minute and once a day. ... 
www.sharewareconnection.com/titles/dynamic-web-page7.htm - 91k - 

Cached - Similar pages 

Softwa re Download: D y namic W eb Page 

categorized by country. Software Listing: Dynamic Web Page LocalSearch Deluxe lists 

every search result according to frequency of appearance. ... 
www.sharewareconnection.com/tities/dynamic-web-page17.htm - 93k - 
Cached - Similar pages 

Web Page Desig n Downloa ds - Dynamic Web Page Eff ects allow a ... 
Dynamic Web Page Effects contains several products which allow a variety of .... cards 
printing and RFID (Radio Frequency and Identification) encoding. ... 
pcwin.com/popular/Web_Page_Design-8.htm - 69k - Cached - Similar pages 

Dyn amic Web Pa g e Effects Software - Sound Effect Maker. webcamAMP ... 
EzDNS publishes and/or emails your PC's dynamic IP address (both local and outside 
address) and connection information to a remote web page or text file ... 
pcwin.com/software/Dynamic_Web_Page_Effects/index-6.htm - 70k - 
Cached - Similar pages 

Pe rformance of Dynamic Web Pa g e Generation for Database-driven Web ... 
generate such dynamic web page for each user request, requires heavy dynamic script 

execution In addition to the access frequency (or request rate), ... 

doi.ieeecomputersociety.org/10.1109/NWESP.2Q06.24 - Similar p ages 

The Database B ehind Your Web P age - Terminolog y: 2003 Marketin g ... 
Dynamic Page: A dynamic Web page contains content that a user can interact with, such 
as information that is tied to a database. The user can request that ... 
instructor.mstc.edu/instructor/dcolby/images/WBEA/WBEA_SeminarWeb/Terminology.htm - 
1 2k - Cached - Smiiar_Bages 
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Active Software: dynamic web page effects draw automate screen calendar setup ... 
reservation frequency, hours of high restaurant load, busiest tables, ... 
www.filesland.com/download/active-48.html - 20k - Cached - Similar pages 

A Variety of effects and controls 

Dynamic Web Page Effects contains several products which allow a variety of effects ... 
feedback, frequency, delay, waveform, phase, gain, attack, release, ... 
www.soft-articles.com/programs/sirius-computer-consultants-limited/dynamic-web-page- 
effects.html - 15k - Cached - Similar pages 

Web page o ptimization systems invention 

Also, it provides such a method wherein such second computer processor means for 
selecting at least some dynamic web-page content comprises: sixth computer ... 
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1 Scalab le s ystems for dyna mic content: Gl obetp: template-based data base replication 
^ for scalable web applications 

^ Tobias Groothuyse, Swaminathan Sivasubramanian, Guillaume Pierre 

May 2007 Proceedings of the 16th international conference on World Wide Web 

WWW '07 
Publisher: ACM Press 

Full text available: ^ pdf( 238. 13 KB) Additional Information: full citation, abstract, refe rences, index Jerms 

Generic database replication algorithms do not scale linearly in throughput as all update, 
deletion and insertion (UDI) queries must be applied to every database replica. The 
throughput is therefore limited to the point where the number of UDI queries alone is 
sufficient to overload one server. In such scenarios, partial replication of a database can 
help, as UDI queries are executed only by a subset of all servers. In this paper we 
propose GlobeTP, a system that employs partial replication t ... 

Keywords: database replication, partial replication, scalability, web applications 



2 Applications: D ynamic coordination of information mana g ement services for 
<H> processin g d ynamic web content 
^ In-Young Ko, Ke-Thia Yao, Robert Neches 

May 2002 Proceedings of the 11th international conference on World Wide Web 

WWW '02 
Publisher: ACM Press 

Additional Information: fuN citation, abstract, references, citings, index 
terms 



Full text available: &pdf(1,15_MB) 



Dynamic Web content provides us with time-sensitive and continuously changing data. To 
glean up-to-date information, users need to regularly browse, collect and analyze this 
Web content. Without proper tool support this information management task is tedious, 
time-consuming and error prone, especially when the quantity of the dynamic Web 
content is large, when many information management services are needed to analyze it, 
and when underlying services/network are not completely reliable. This pap ... 

Keywords: dynamic service coordination, dynamic web content, scalable component- 
based software systems, semantic interoperability, web information management systems 



3 Posters: Keyword-based fra g ement detection for dynamic web content deliver y Q 
Daniel Brodie, Amrish Gupta, Weisong Shi 

May 2004 Proceedings of the 13th international World Wide Web conference on 
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Alternate track papers & posters WWW Alt. '04 
Publisher: ACM Press # 

Full text available: t jg[ pdf(48.04 KB ) Additional Information: full citation , abstract, references , index terms 

Fragment-based caching has been proposed as a promising technique for dynamic Web 
content delivery and caching. Most of these approaches either assume the fragment- 
based content is served by Web server automatically, or look at server-side caching only. 
There is no method of extracting fragments from an existing dynamic Web content, which 
is of great importance to thesuccess of fragment-based caching. Also, current 
technologies for supporting dynamic fragments do not allow to take into account c ... 

Keywords: dynamic web content delivery, fragment detection 



4 Web and e -business application: Dynamically g eneratin g web ap plication fra gments 
<||> from pag e templates 
Uwe Zdun 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing SAC '02 
Publisher: ACM Press 

Full text available- figl pdf (900.91 KB) Additlonal Information: full citation , abstract , references , citings, index 
' la- ' • terms 

Web-based applications are typically required to be highly customizable and configurable. 
New application requirements have to be introduced rapidly, often without stopping the 
running application process. Moreover, in many cases the same business logic has to be 
presented to different channels and/or user interfaces. In this paper we present a 
dynamic page template architecture for decomposing configurable and representational 
fragments of the application from the business logic. Page templates ... 

Keywords: dynamic software architecture, Object-Oriented Scripting, web engineering 



5 Perfo rmance of service oriented systems: Speed-up SOAP_.processjhg_by data 
H> mapping template • 
^ Wei Jun, Hua Lei, Niu Chunlei 

May 2006 Proceedings of the 2006 international workshop on Service-oriented 
software engineering SOSE '06 

Publisher: ACM Press 

Full text available: *g) pdf(245.04 K B) Additional Information: full citation , abstra ct, reference s, i ndex term s 

Web Services is gaining popularity in distributed computing due to its loosely-coupled, 
high-interoperable and platform-independent characteristics. However, web services 
suffers performance penalty because XML based SOAP is used to specify wire message 
format, and SOAP processing largely affects the performance of web services. In this 
paper, we identify that data model mapping between XML data and Java data is the main 
impact factor on performance, and propose a new paradigm of data model mapp ... 

Keywords: SOAP, context free grammar, data mapping template, dynamic early binding, 
web services 




6 XML templ ates and cachi ng in W ASH . 
Peter Thiemann 

August 2003 Proceedings of the 2003 ACM SIGPLAN workshop on Haskell Haskell '03 
Publisher: ACM Press 

Full text available* f?| pdf(129 06 KB) Additional Information: full citation, abstract, references, citings, index 
• [A] = terms 

Caching of documents is an important concern on the Web. It is a major win in all 
situations where bandwidth is limited. Unfortunately, the increasing spread of dynamically 
generated documents seriously hampers traditional caching techniques in browsers and 



http://portal.acm.or^^ 11/8/2007 



Results (page 1): dynamic +web +template 



Page 3 of 7 



on proxy servers. WASH/CGI is a Haskell-based domain specific language for creating 
interactive Web applications. The Web pages generated by a WASH/CGI application are 
highly dynamic and cannot be cached with traditional means. We show how to i ... 

Keywords: annotated languages, caching, web programming 



Industrial sessions: middle-tier cachin g : Web cachin g for database a p plications with 
Oracle Web Cache 

Jesse Anton, Lawrence Jacobs, Xiang Liu, Jordan Parker, Zheng Zeng, Tie Zhong 
June 2002 Proceedings of the 2002 ACM SIGMOD i nternational conference on 

Management of data SIGMOD '02 
Publisher: ACM Press 

Full text available* , Pl Ddf(601 17 KB) Additional Information: full citation , abstract , references , citings, index 
" ^ : terms 

We discuss several important issues specific to Web caching for content dynamically ' 
generated from database applications. We present the techniques employed by Oracle 
Web Cache to address these issues. They include: content disambiguation based on 
information in addition to the URL, transparent session management, partial-page caching 
for personalization, and broad-scope invalidation with performance assurance heuristics. 

Keywords: caching, consistency, disambiguation, dynamic content, fragment, heuristics, 
invalidation, partial-page caching, performance, personalization, session, template 



8 Tools & techniques track: framew orks for b uildin g libraries: A web service. framework 
^ for embeddin g discovery se rvices i n distribut ed lib rary interfaces 
John Weatherley 

June 2005 Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries 
JCDL '05 

Publisher: ACM Press 

Full text available* fjp pdf d 64.33 KB) Addjtional Information: full citation , abstract , references , citings, index 

terms 

Significant barriers deter web page designers and developers from incorporating dynamic 
content from web services into their page designs. Web services typically require 
designers to learn service protocols and have access to and knowledge of dynamic 
application servers or CGI in order to incorporate dynamic content into their pages. This 
paper describes a framework for embedding discovery services in distributed interfaces 
that seeks to simplify this process and eliminate these barriers, making ... 

Keywords: discovery, retrieval, search, web services 



9 Versioning and fragm enta tion: Automatic detection of frag merits in dynam jcajly 

generated web pages 
^ Lakshmish Ramaswamy, Arun Iyengar, Ling Liu, Fred Douglis 

May 2004 Proceedings of the 13th international conference on World Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available- f?l pdf( 268 12 KB) Additlona, Information: full citati on, abstract, references, citings, index 
' u±l terms 

Dividing web pages into fragments has been shown to provide significant benefits for both 
content generation and caching. In order for a web site to use fragment-based content 
generation, however, good methods are needed for dividing web pages into fragments. 
Manual fragmentation of web pages is expensive, error prone, and unscalable. This paper 
proposes a novel scheme to automatically detect and flag fragments that are cost- 
effective cache units in web sites serving dynamic content. We consider ... 
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Keywords: L-P fragments, dynamic content caching, fragment detection, fragment-based 
caching, shared fragments 



10 The <bi gwig> project 

Claus Brabrand, Anders M0iler, Michael I. Schwartzbach 

May 2002 ACM Transactions on Internet Technology (TOIT), Volume 2 issue 2 
Publisher: ACM Press 

Full text available: ^p_df(58a33 KB) Additional Information: fujl citation, abstract, references, citings, index 

terms 

We present the results of the <bigwig> project, which aims to design and implement a 
high-level domain-specific language for programming interactive Web services. 

A fundamental aspect of the development of the World Wide Web during the last decade 
is the gradual change from static to dynamic generation of Web pages. Generating Web 
pages dynamically in dialog with the client has the advantage of providing up-to-date and 
tailor-made information. The development of systems ... 

Keywords: Interactive Web services, program analysis 



11 A type system for dynamic Web documents | 
a Anders Sandholm, Michael I. Schwartzbach 

^ January 2000 Proceedings of the 27th ACM SIGPLAN-SIGACT symposium on 
Principles of programming languages POPL '00 

Publisher: ACM Press 

_ 1 1 , , ., Ul -s* , M ooym Additional Information: full citation , abstract , references , citings, index 

Full text available: pdf( 1.32 MB ) * 

^ terms 

Many interactive Web services use the CGI interface for communication with clients. They 
will dynamically create HTML documents that are presented to the client who then 
resumes the interaction by submitting data through incorporated form fields. This protocol 
is difficult to statically type-check if the dynamic documents are created by arbitrary script 
code using printf-like statements. Previous proposals have suggested using static 
document templates which trades flexibility for safety. W ... 

12 Measurin g and characterizin g end- to- e nd Internet service performance j 
Ludmila Cherkasova, Yun Fu, Wenting Tang, Amin Vahdat 

November 2003 ACM Transactions on Internet Technology (TOIT), Volume 3 issue 4 
Publisher: ACM Press 

., A •, u. ^ ^ R *r>\ Additional Information: full citation , abstract , references , citings, index 

Full text available: W pdf d.46 MB ) ; * 

terms 

Fundamental to the design of reliable, high-performance network services is an 
understanding of the performance characteristics of the service as perceived by the client 
population as a whole. Understanding and measuring such end-to-end service 
performance is a challenging task. Current techniques include periodic sampling of service 
characteristics from strategic locations in the network and instrumenting Web pages with 
code that reports client-perceived latency back to a performance server. Li ... 

Keywords: End-to-end service performance, QoS, network packet traces, passive ' 
monitoring, reconstruction of web page composition, web site performance 

13 Software design ^languages and systems: Template extractipn frQrrLC_andidate 
<#> tem plate set generation: a structure and content approach 

^ Hang Su, Qiaozhu Mei 

March 2005 Proceedings of the 43rd annual Southeast regional conference - Volume 
2 ACM-SE 43 
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Publisher: ACM Press 

Full text available:^ pdf(467.05 KB) Additional Information: fulidtation, abstract, refergnc.es, indexjejnis 

This paper introduces a new approach of webpage template extraction. Unlike traditional 
methods which concern only content information, this paper considers both structure and 
content similarity. It uses natural table structure as content units instead of text blocks or 
pagelets. This paper novelly and formally defines the templates and other concepts. It 
introduces a new concept, candidate template, which is an intermediate level of abstract 
table structure. A candidate template only cov ... 

Keywords: candidate template set, structure similarity, table informativeness, template 
extraction 



14 Short Papers: Desi g ning dynamic web pag es and persistence in the WYSIWYG 
interface 

David Wolber, Yingfeng Su, Yih Tsung Chiang 

January 2002 Proceedings of the 7th international conference on Intelligent user 
interfaces IUI 02 

Publisher: ACM Press 

Full text available- "fiCl pdf(132 75 KB) Addjt ' ona! Information: full citation, abstract, refere nces, citings, index 
l^j terms 

WebSheets is a programming in the WYSIWYG interface tool for building dynamic web 
pages that access and modify databases. Without programming, designers can specify not 
only the presentation of a page, but the dynamic content as well. This capability is 
facilitated through a novel application of Programming by Example (PBE), Query by 
Example (QBE), and spreadsheet formulas within the WYSIWYG HTML editor environment. 

Keywords: HTML, PBE, QBE, dynamic web pages, spreadsheets 



15 XML transactions: An ob j ect-oriented extension of XML for autonomous web 
d|> ap plications 

Hasan M. Jamil, Giovanni A. Modica 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management CIKM '02 

Publisher: ACM Press 

Full text available: *g] pdf (277.52 KB) Additional Information: full citation , abstract , references , index terms 

While the idea of extending XML to include object-oriented features has been gaining 
popularity in general, the potential of inheritance in document design has not been well 
recognized in contemporary research. In this paper we demonstrate that XML with 
dynamic inheritance aids better document designs and decreased management overheads 
and support increased autonomy. As an extended application, we point out that dynamic 
inheritance also helps effective automated web portal and ontology designs. W ... 

Keywords: XML, autonomous objects, document structuring, dynamic object hierarchy, 
inheritance, object-orientation, web 



16 Eng ineerin g desi g n: Automatic accessibility evaluation of dynamic web pages j 
^ generated throu g h XSLT 

^ Andre Pimenta Freire, Renata Pontin de Mattos Fortes 

May 2005 Proceedings of the 2005 International Cross-Disciplinary Workshop on 

Web Accessibility (W4A) W4A '05 
Publisher: ACM Press 

Full text available: l Q pdf(99.18 KB) Additional Information: full citation , abstract , references , index terms 
Much effort has been dedicated to develop software aids for authoring and evaluating Web 



http://portal.acm.org/results.cfa^ 



11/8/2007 



Results (page 1): dynamic +web +template 



Page 6 of 7 



pages using accessibility guidelines and standards. The evaluation of dynamic Web pages 
is a problem still unsolved in the field of automatic evaluation tools, since the current 
evaluators are only able to evaluate static Web pages. Stone and Dhiensa have addressed 
this problem/and proposed a method for evaluating the accessibility of dynamic Web 
pages using a generalized page which contains all possible ou ... 

Keywords: XML, user interface, web accessibility evaluation 



1 7 Rese arc h sessio ns: distributed systems: Proxy-based acceleration J?Ldy,namically 
<H> generated content on the world wide web: an approach and implementation 
" Anindya Datta, Kaushik Dutta, Helen Thomas, Debra VanderMeer, Suresha, Krithi 
Ramamritham 

June 2002 Proceedings of the 2002 ACM SIGMOD i nternational conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Full text available* fiQ pdf(137J\/IB) Additional Information: full citation , abstract, references , citings, index 

terms 

As Internet traffic continues to grow and web sites become increasingly complex, 
performance and scalability are major issues for web sites. Web sites are increasingly 
relying on dynamic content generation applications to provide web site visitors with 
dynamic, interactive, and personalized experiences. However, dynamic content generation 
comes at a cost — each request requires computation as well as communication across 
multiple components. To address these issues, various dynamic content each ... 

Keywords: dynamic content, edge caching, proxy-based caching 



1 8 Man aging multime dia in documents: Applying caT's programmable browsing 
^ semantics tqj>£ec]fyj/v^ documents. tjiat jeflectj?^ and 

co mmun ity 

Richard Furuta, Jin-Cheon IMa 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
DocEng '02 

Publisher: ACM Press 

Full text available: pdf(947.45 KB) Additional Information: full citation , abstract, references, citings, index 
• [a| - terms 

In this paper we discuss application of caT, which extends the Trellis Petri-net-based 
model of document/hypertext, towards specification of Web-browsable documents that 
respond to their reader's characteristics, browsing activities, use environment, and 
interactions with other readers. The Petri net basis provides both a graphical 
representation of the nodes and links in the. hypertext and also an automaton-based 
specification of the browsing behaviors encountered by readers examining the hypert ... 

Keywords: Petri-net-based hypertext, Trellis, caT, context-aware hypertext 



19 Education: D eveloping a uni versally accessible web portal for traditional and distanc e 
<g> learn ing versions of a com puter literacy c ourse: an Au burn University case stu dy 
^ Daniela Marghitu, Chris Harmon, Kai Chang 

March 2005 Proceedings of the 43rd annual Southeast regional conference - Volume 
1 ACM-SE 43 

Publisher: ACM Press 

Full text available: pdf(767.64 KB) Additional Information: full citation, ab_stract, references, indexjerms 

Information Technology (IT) offers a wide range of opportunities for education and career 
enhancement for those who have access to the technologies they employ. However, many 
people find themselves on the wrong side of the digital divide that separates those with 
access to new technologies and those without. Even if they have access to these 



http://portahacm.org/resultsxfm?coll=ACM&dI=ACM&CFID=568 



11/8/2007 



Results (page 1): dynamic +web +template Page 7 of 7 

technologies, some people with disabilities find themselves on the wrong side of a second 
digital divide that is caused by the inaccessible design of course ... 

Keywords: accessible web design and development, assistive technology, distance 
education 



20 Document editing for the web: Templates, microformats and structured editin g 




Francesc Campoy Flores, Vincent Quint, Irene Vatton 

October 2006 Proceedings of the 2006 ACM symposium on Document engineering 
DocEng '06 

Publisher: ACM Press 

Full text available: pdf(236.63 K8) Additional Information: full citation, abstract , references , index terms 

Microformats and semantic XHTML add semantics to web pages while taking advantage of 
the existing (X)HTML infrastructure. This approach enables new applications that can be 
deployed smoothly on the web. But there is currently no way to describe rigorously this 
type of markup and authors of web pages have very little help for creating and encoding 
semantic markup. A language that addresses these issues is presented in this paper. Its 
role is to specify semantically rich XML languages in terms of ... 

Keywords: document authoring, document models, document templates, microformats, 
semantic XHTML, structure editing, world wide web 
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1 Dyjiamic.Ba^ content in t he in ternet 

Pablo Rodriguez, Ernst W. Biersack 

August 2002 IEEE/ACM Transactions on Networking (TON), Volume 10 issue 4 
Publisher: IEEE Press 

Full text available' Wi pdf(350 99 KB) Additional Information: fuiJ_citatioQ, abstract, references, citings, index 

* terms 

Popular content is frequently replicated in multiple servers or caches in the Internet to 
offload origin servers and improve end-user experience. However, choosing the best 
server is a nontrivial task and a bad choice may provide poor end user experience. In 
contrast to retrieving a file from a single server, we propose a parallel-access scheme 
where end users access multiple servers at the same time, fetching different portions of 
that file from different servers and reassembling them locally. ... 

Keywords: HTTP, content distribution, internet, mirroring, parallel access, peer-to-peer, 
replication, web 



2 Ap plications: Dynamic coordination of information mana g ement services for 

processin g d ynamic web content 
^ In-Young Ko, Ke-Thia Yao, Robert Neches 

May 2002 Proceedings of the 11th international conference on World Wide Web 
WWW 02 

Publisher: ACM Press 

Additional Information: full citation, abstract, re feren ces, citings, i ndex 
terms 



Full text available: «g.pdf(iJ5_MB) 



Dynamic Web content provides us with time-sensitive and continuously changing data. To 
glean up-to-date information, users need to regularly browse, collect and analyze this 
Web content. Without proper tool support this information management task is tedious, 
time-consuming and error prone, especially when the quantity of the dynamic Web 
content is large, when many information management services are needed to analyze it, 
and when underlying services/network are not completely reliable. This pap ... 

Keywords: dynamic service coordination, dynamic web content, scalable component- 
based software systems, semantic interoperability, web information management systems 



3 Mobility & wireless access: Dynamic s ervi ce reconfi gu ration for wireless web access 
Siu-Nam Chuang, Alvin T.S. Chan, Jiannong Cao, Ronnie Cheung 
May 2003 Proceedings of the 12th international conference on World Wide Web 
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WWW '03 

Publisher: ACM Press 

Full text available: "HI pdf(620.78 KB) Additlonal Information: full citation , abstract, references , citings, index 

terms 

This paper describes a dynamic service reconfiguration model where the proxy is 
composed of a chain of service objects called mobilets (pronounced as mo-be-lets), which 
can be deployed onto the network actively. This model offers flexibility because the chain 
of mobilets can be dynamically reconfigured to adapt to the vigorous changes in the 
characteristics of the wireless environment, without interrupting the service provision for 
other mobile nodes. Furthermore, mobilets can also be migrated t ... 

Keywords: active services, dynamic service reconfiguration, wireless environment 
adaptation, wireless web access 

4 General ap plications: general applicat ions : Applying parallel dyna mic-res olution §g| 
simulations to accelerate VLSI pow er estimation 

Dhananjai M. Rao, Philip A. Wilsey 

December 2006 Proceedings of the 38th conference on Winter simulation WSC 06 
Publisher: Winter Simulation Conference 

Full text available: *gjpdf (191.64 KB) Additional Information: full citation , abstract , references 

High resolution models of logic circuits need to be used in simulations to accurately track 
logic transitions or glitches, which contribute to the most dominant portion of VLSI power 
dissipated. Unfortunately, simulating large, high resolution models is a time consuming 
task. Although more abstract models that simulate faster can be used, they are less 
accurate as details of glitching activity are absent. This study proposes an alternatively 
approach that dynamically (i.e., during simulation) ch ... 

5 Dyn amic ro le alloc ation for small search eng ine clusters jggjj 
^gk Ndapandula Nakashole, Hussein Suleman, Calvin Pedzai 

October 2007 Proceedings of the 2007 annual research conference of the South 

African institute of computer scientists and information technologists 
on IT research in developing countries SAICSIT '07 

Publisher: ACM Press 

Full text available: "Q.pdf(290^58 KB) Additional Information: full citation, abstract, references, indexjerms 

Search engines facilitate efficient discovery of information in large information 
environments such as the Web. As the amount of information rapidly increases, search 
engines require greater computational resources. Similarly, as the user base increases 
search engines need to handle increasing numbers of user requests. Existing solutions to 
these scalability problems are. often designed for large computer clusters. This paper 
presents a flexible solution that is deployable also on small cluste ... 

Keywords: dynamic allocation, indexing, querying, small search engine cluster 

6 Web-based and Java-based simulation: Dynamic component substitution in web- 
based simulation 

Dhananjai Madhava Rao, Philip A. Wilsey 

December 2000 Proceedings of the 32hd conference on Winter simulation WSC '00 
Publisher: Society for Computer Simulation International 

Full text available: ^ pdf(90 .08 K B) Additional Information: full citation , abstract, references, citings 

Recent breakthroughs in communication and software engineering has resulted in 
significant growth of web-based computing. Web-based techniques have been employed 
for modeling, simulation, and analysis of systems. The models for simulation are usually 
developed using component based techniques. In a component based model, a system is 
represented as a set of interconnected components. A component is a well defined 
software module that is viewed as a "black box" i.e., only its interface is o ... 
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DDD pa pers: Dom ain driven web developm ent with WebJi nn 
Sergei Kojarski, David H. Lorenz 

October 2003 Companion of the 18th annual ACM SIGPLAN conference on Object- 
oriented programming, systems, languages, and applications OOPSLA 
'03 

Publisher: ACM Press 

Full text available- "Pi pdf (266.32 KB) Additional Information: full citation , abstract , references , citings, index 

terms 

Web application development cuts across the HTTP protocol, the client-side presentation 
language (HTML, XML), the server-side technology (Servlets, JSP, ASP, PHP), and the 
underlying resource (files, database, information system). Consequently, web 
development concerns including functionality, presentation, control, and structure cross- 
cut, leading to tangled and scattered code that is hard to develop, maintain, and reuse. In 
this paper we analyze the cause, consequence, and remedy for this cms ... 

Keywords: JSP, adaptability, aspect-oriented programming (AOP), crosscutting concerns, 
dynamic pages, generative programming, inter-crosscutting, intra-crosscutting, model- 
view-controller (MVC), reusability, scattering, struts, tangling, web application, web 
development, web programming 



8 Web s yst em-or iented performance: Capacity planning tools for web and grid 

|k environments 

Sugato Bagchi, Eugene Hung, Arun Iyengar, Norbert Vogl, Noshir Wadia 

October 2006 Proceedings of the 1st international conference on Performance 

evaluation methodolgies and tools valuetools '06 
Publisher: ACM Press 

Full text available: E jg!| pdf(453.91 KB) Additional Information: full citation , abstract , references , index terms 

A key aspect in managing resources for customer sites is to predict and assess the load 
associated with a site in order to figure out how best to allocate resources for the site 
over time and to efficiently schedule tasks. The cost associated with the site and return on 
investment are also key parameters. This paper describes work we have done in 
developing tools for answering these critical questions. The tools use both analytical 
models and discrete event simulations to predict performance and ... 

Keywords: capacity planning, grid computing, performance modeling, web performance 
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Masaru Kitsuregawa, Takahiko Shintani, Iko Pramudiono 

January 2001 Australian Computer Science Communications , Proceedings of the 

workshop on Information technology for virtual enterprises ITVE •Ol , 
Proceedings of the workshop on Information technology for virtual 
enterprises ITVE '01, volume 23 issue 6 
Publisher: IEEE Computer Society, IEEE Computer Society Press 
Full text available:^ pdf(674.03 KB) 
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Web mining can be classified into two categories, Web access log mining and Web 
structure mining. We performed association rule mining and sequence pattern mining 
against the access log which was accumulated at NTT Software Mobile Info Search portal 
site. Detail web log mining process and the rules we derived are reported in this paper. 
The parallel association rule mining is explored on large scale PC cluster system. 
Parallelism is key to improve the performance. We achieved substantial speed u ... 

10 The state of the art in locally distributed Web-server systems 
Valeria Cardellini, Emiliano Casalicchio,' Michele Colajanni, Philip S. Yu 
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June 2002 ACM Computing Surveys (CSUR), Volume 34 issue 2 
Publisher: ACM Press 

Full text available* t p| pdf (1 41 MB) Additional Information: full citation , abstract , references , citings, in dex 

terms 

The overall increase in traffic on the World Wide Web is augmenting user-perceived 
response times from popular Web sites, especially in conjunction with special events. 
System platforms that do not replicate information content cannot provide the needed 
scalability to handle large traffic volumes and to match rapid and dramatic changes in the 
number of clients. The need to improve the performance of Web-based services has 
produced a variety of novel content delivery architectures. This article w ... 

Keywords: Client/server, World Wide Web, cluster-based architectures, dispatching 
algorithms, distributed systems, load balancing, routing mechanisms 

11 The Web as a parallel cor pus 
Philip Resnik, Noah A. Smith 

September 2003 Computational Linguistics, volume 29 issue 3 
Publisher: MIT Press 

Full text available: « pdf(539.83 KB) Additional Information: full citation , abstract, references, citings, index 

terms 

Parallel corpora have become an essential resource for work in multilingual natural 
language processing. In this article, we report on our work using the STRAND system for 
mining parallel text on the World Wide Web, first reviewing the original algorithm and 
results and then presenting a set of significant enhancements. These enhancements 
include the use of supervised learning based on structural features of documents to 
improve classification performance, a new content-based measure of translati ... 

1 2 Web technologies and applications ( WTA) :_M^JIlPl]ica) evaluation ^pf clienl-side Q 
<§> serv er selection policies for accessing ieplicated„we_b services 

Nabor C. Mendonga, Jose Airton F. Silva 

March 2005 Proceedings of the 2005 ACM symposium on Applied computing SAC '05 
Publisher: ACM Press 

Full text available: 'g] pdf(231.14 KB) Additional Information: full citation , abstract , references, index terms 

Replicating web services at geographically distributed servers can offer client applications 
with a number of benefits, including higher service availability and improved response 
time. However, selecting tfte"best" server to invoke at the client side is not a trivial task, 
as this decision needs to account for (and is affected by) a number of factors, such as 
local connection capacity, external network conditions and servers workload. This paper 
presents the results of an experiment in which we ... 

Keywords: empirical evaluation, replicated web services, server selection 



13 Web-based simulation: Web-ba sed simulation 2: performance prediction of dynamic 
c omponent substitutions 
Dhananjai M. Rao, Philip A. Wilsey 

December 2002 Proceedings of the 34th conference on Winter simulation: exploring 

new frontiers WSC '02 
Publisher: Winter Simulation Conference 

Full text available: ^jidl(l_7_6JS2 KB) Additional Information: fuM citaiio_n, abstract, references 

The Web-based Environment for Systems Engineering (wese) is a web-based modeling 
and simulation environment in which the level of abstraction of a model can be configured 
<i > statically </i > (prior to simulation) or <i>dynamically</i> (during simulation) by 
substituting a <i>module</i> (set of components) with an equivalent component or vice 
versa through a process called Dynamic Component Substitution (DCS). DCS can 
considerably improve the overall efficiency of simulat ... 
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14 Technical pa pers: Revisitin g web server workload in variants in the context of j| 
scientific web sites 

Anne M. Faber, Minaxi Gupta, Camilo H. Viecco 

November 2006 Proceedings of the 2006 ACM/IEEE conference on Supercomputfng SC 
'06 

Publisher: ACM Press 

Full text available: S pdf(482.54 KB) AJJ !. llf . , H 

[l] html(2.02 KB) ' Add,tlonal Information: full citation , abstract , references , index terms 

The Web has evolved much from when Arlitt and Williamson proposed the ten Web 
workload invariants more than a decade ago. Many diverse communities now depend on 
the Web in their day-to-day lives. A current knowledge of the invariants for the Web is 
useful for performance enhancement and for synthetic Web workload generation. 
. Invariants can also serve as a useful tool for detecting anomaly and misuse, a new 
dimension of Web usage arising from the change in trust assumptions in the Internet in 
the... 

1 5 Web search ing: Specialisation dynamics in federate,^ 
Rinat Khoussainov, Nicholas Kushmerick 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management WIDM '04 

Publisher: ACM Press 

Full text available: t g pdf d 38.32 KB) Additional Information: full citation , abstract , references , index term s 

Organising large-scale Web information retrieval systems into hierarchies of topic-specific 
search resources can improve both the quality of results and the efficient use of 
computing resources. A promising way to build such systems involves federations of 
topic-specific search engines in decentralised search environments. Most of the previous 
research concentrated on various technical aspects of such environments (e.g. routing of 
search queries or merging of results from multiple sources). W ... 

Keywords: competition, federated web search, topic specialisation 
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16 Efficient al g orithms for Web services selection with end-to-end QoS constraints 
Tao Yu, Yue Zhang, Kwei-Jay Lin 

V May 2007 ACM Transactions on the Web (TWEB), Volume l issue l 
Publisher: ACM Press 

Full text available: ^pdf(832 ? 74 KB) Additional Information: full citation, abstract, references, jndexjerms 

Service-Oriented Architecture (SOA) provides a flexible. framework for service 
composition. Using standard-based protocols (such as SOAP and WSDL), composite 
services can be constructed by integrating atomic services developed independently. 
Algorithms are needed to select service components with various QoS levels according to 
some application-dependent performance requirements. We design a broker-based 
architecture to facilitate the selection of QoS-based services. The objective of service s ... 

Keywords: End-to-end QoS, Web services, service composition, service oriented 
architecture (SOA), service selection 
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18 Cross-lan guag e informati on retrieval: Translating unknown queries with web corpora Q 
<§> for cross-language information retrieval 

^ Pu-Jen Cheng, Jei-Wen Teng, Ruei-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, Lee-Feng 
Chien 

July 2004 Proceedings of the 27th annual international ACM SIGIR conference on 

Research and development in information retrieval SIGIR '04 
Publisher: ACM Press 

Full text available* pdf(387 08 KB) Addit ' ona, Information: full citation, abstract , referen ces, citings, index 
• [aj = terms 

It is crucial for cross-language information retrieval (CLIR) systems to deal with the 
translation of unknown queries due to that real queries might be short. The purpose of 
this paper is to investigate the feasibility of exploiting the Web as the corpus source to 
translate unknown queries for CLIR. We propose an online translation approach to 
determine effective translations for unknown query terms via mining of bilingual search- 
result pages obtained from Web search engines. This approach can a ... 

Keywords: cross-language information retrieval, cross-language web search, query 
translation 

19 Creatin g m ultilin g ual transla tion lexicons with regi onal varia tions using web co rpora H 
Pu-Jen Cheng, Yi-Cheng Pan, Wen-Hsiang Lu, Lee-Feng Chien 

July 2004 Proceedings of the 42nd Annual Meeting on Association for Computational 
Linguistics ACL '04 

Publisher: Association for Computational Linguistics 

Full text available: pdf( 303.85 KB) Additional Information: fuLcitation, abstract, r eference s 

The purpose of this paper is to automatically create multilingual translation lexicons with 
regional variations. We propose a transitive translation approach to determine translation 
variations across languages that have insufficient corpora for translation, via the mining of 
bilingual search-result pages and clues of geographic information obtained from Web 
search engines. The experimental results have shown the feasibility of the proposed 
approach in efficiently generating translation equivalen ... 

20 A performance comparison of dynamic Web technolog ies . j| 
Jjby Lance Titchkosky, Martin Arlitt, Carey Williamson 

N/ December 2003 ACM SIGMETRICS Performance Evaluation Review, Volume 31 issue 3 
Publisher: ACM Press 

Full text available: *g| pdf(1.Q2 MB) Additional Information: full citation , abstract , references 

Today, many Web sites dynamically generate responses "on the fly" when user requests 
are received. In this paper, we experimentally evaluate the impact of three different 
dynamic content technologies (Perl, PHP, and Java) on Web server performance. We 
quantify achievable performance first for static content serving, and then for dynamic 
content generation, considering cases both with and without database access. The results 
show that the overheads of dynamic content generation reduce the peak re ... 

Keywords: Dynamic Content Generation, Performance Evaluation, Web Performance, 
Web Server Benchmarking 
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1 Englr3Mripg_designi Autom_atic accessibility evaluation of dynamic web pages j 
^ g enerated through XSLT 

^ Andre Pimenta Freire, Renata Pontin de Mattos Fortes 

May 2005 Proceedings of the 2005 International Cross-Disciplinary Workshop on 

Web Accessibility (W4A) W4A '05 
Publisher: ACM Press 

Full text available: t g) pdf(99. 18 KB) Additional Information: full citation , abstract, reference s, in dex terms 

Much effort has been dedicated to develop software aids for authoring and evaluating Web 
pages using accessibility guidelines and standards. The evaluation of dynamic Web pages 
is a problem still unsolved in the field of automatic evaluation tools, since the current 
evaluators are only able to evaluate static Web pages. Stone and Dhiensa have addressed 
this problem, and proposed a method for evaluating the accessibility of dynamic Web 
pages using a generalized page which contains all possible ou ... 

Keywords: XML, user interface, web accessibility evaluation 



2 A performance comparison of dynamic Web technolo gies 
Lance Titchkosky, Martin Arlitt, Carey Williamson 

December 2003 ACM SIGMETRICS Performance Evaluation Review, Volume 31 issue 3 
Publisher: ACM Press 

Full text available: *g| pdf(1.02 MB ) Additional Information: full citati on, abstract, references 

Today, many Web sites dynamically generate responses "on the fly" when user requests 
are received. In this paper, we experimentally evaluate the impact of three different 
dynamic content technologies (Perl, PHP, and Java) on Web server performance. We 
quantify achievable performance first for static content serving, and then for dynamic 
content generation, considering cases both with and without database access. The results 
show that the overheads of dynamic content generation reduce the peak re ... 

Keywords: Dynamic Content Generation, Performance Evaluation, Web Performance, 
Web Server Benchmarking 
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|^ g enerated content on the world wide web: an approach and implementation . 
" Anindya Datta, Kaushik Dutta, Helen Thomas, Debra VanderMeer, Suresha, Krithi 
Ramamritham 

June 2002 Proceedings of the 2002 ACM SIGMOD i nternational conference on 
Management of data SIGMOD 02 

Publisher: ACM Press 
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As Internet traffic continues to grow and web sites become increasingly complex, 
performance and scalability are major issues for web sites. Web sites are increasingly 
relying on dynamic content generation applications to provide web site visitors with 
dynamic, interactive, and personalized experiences. However, dynamic content generation 
comes at a cost — each request requires computation as well as communication across 
multiple components. To address these issues, various dynamic content each ... 

Keywords: dynamic content, edge caching, proxy-based caching 



4 Full Technical Pa pers: D ynamic we b pag e authorin g b y exam ple using ontology- 
^ based domain knowled ge 
^ Jose A. Madas, Pablo Castells 

January 2003 Proceedings of the 8th international conference on Intelligent user 

interfaces IUI '03 
Publisher: ACM Press 

Full text available: ^| pdf( 469.78 KB ) Additional Information: full citation , abstract : references , index terms • 

Authoring dynamic web pages is an inherently difficult task. We present DESK, an 
interactive authoring tool that allows the customization of dynamic page generation 
procedures with no a-priori tool-specific skill requirements from authors. Our approach 
consists of combining Programming By Example (PBE) techniques with an ontology-based 
representation of knowledge displayed in web pages. DESK acts as a client-side 
complement of a dynamic web page generation system, PEGASUS, which generates HTML 
p... 

Keywords: knowledge-based UI design, model-based paradigm, ontology, programming 
by example 



5 Proxy-based acce leration of dynamically generated content on the world wide web: 
<§> An approach and implementation 

^ Anindya Datta, Kaushik Dutta, Helen Thomas, Debra Vandermeer, Krithi Ramamritham 
June 2004 ACM Transactions on Database Systems (TODS), Volume 29 issue 2 
Publisher: ACM Press 

Full text available: « pdf(927.23 KB ) Additional Information: full citation , ap pendices and su ppjements, ■ 

abstract, references , index terms 

As Internet traffic continues to grow and websites become increasingly complex, 
performance and scalability are major issues for websites. Websites are increasingly 
relying on dynamic content generation applications to provide website visitors with 
dynamic, interactive, and personalized experiences. However, dynamic content generation 
comes at a cost— each request requires computation as well as communication across 
multiple components.To address these issues, various dynamic content caching ap ... 

Keywords: Edge caching, caching dynamically generated content, fragment caching, 
implementation, proxy caching, world wide web 



6 Applications: Dynamic coordination of information m_^nagemenj_sei^ic for 
<H> proces sing dynamic web content 
^ In-Young Ko, Ke-Thia Yao, Robert Neches 

May 2002 Proceedings of the 11th international conference on World Wide Web 

WWW '02 
Publisher: ACM Press 

Full text available 1 ^ pdf(1 15 MB) Additional Information: full citation , abstract, references , citings, index 
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Dynamic Web content provides us with time-sensitive and continuously changing data. To 
glean up-to-date information, users need to regularly browse, collect and analyze this 
Web content. Without proper tool support this information management task is tedious, 
time-consuming and error prone, especially when the quantity of the dynamic. Web 
content is large, when many information management services are needed to analyze it, 
and when underlying services/network are not completely reliable, This pap ... 

Keywords: dynamic service coordination, dynamic web content, scalable component- 
based software systems, semantic interoperability, web information management systems 



Generatin g d ynamic content at database-backed web servers: c g i-bin vs. mod perl | 
Alexandros Labrinidis, Nick Roussopoulos 
March 2000 ACM SIGMOD Record, Volume 29 issue l 

Publisher: ACM Press 

Full text available: *g) pdf(508.64 KB) Additional Information: full citation, abstract, citings, index term s 

Web servers are increasingly being used to deliver dynamic content rather than static 
HTML pages. In order to generate web pages dynamically, servers need to execute a 
script, which typically connects to a DBMS. Although CGI was the first approach at server 
side scripting, it has significant performance shortcomings. Currently, there are many 
alternative server side scripting architectures which offer better performance than CGI. In . 
this paper, we report our experiences using mod_pe ... 

8 Versionin q and frag mentation: Automatic detection of fra g ments in dynam ically | 
<g> g enerated web pages 

^ Lakshmish Ramaswamy, Arun Iyengar, Ling Liu, Fred Douglis 

May 2004 Proceedings of the 13th international conference on World Wide Web 

WWW '04 
Publisher: ACM Press 

r- ■■ 4 t u. en A<tn*a An Additional Information: full citation , abstract , references , citings, index 
Full text available: *g] pdf (268.12 KB) 

Dividing web pages into fragments has been shown to provide significant benefits for both 
content generation and caching. In order for a web site to use fragment-based content 
generation, however, good methods are needed for dividing web pages into fragments. 
Manual fragmentation of web pages is expensive, error prone, and unscalable. This paper 
proposes a novel scheme to automatically detect and flag fragments that are cost- 
effective cache units in web sites serving dynamic content. We consider ... 

Keywords: L-P fragments, dynamic content caching, fragment detection, fragment-based 
caching, shared fragments 



Efficiently servin g d ynamic data at highly accessed web sites 

James R. Challenger, Paul Dantzig, Arun Iyengar, Mark S. Squillante, Li Zhang 

April 2004 IEEE/ ACM Transactions on Networking (TON), volume 12 issue 2 

Publisher: IEEE Press 

ma 1 , u . £5* a*,a™ ac Additional Information: full citation, abstract , references, citin gs, index 

Full text available: rq pdf(499.05 KB ) a 

« terms 

We present architectures and algorithms for efficiently serving dynamic data at highly 
accessed Web sites together with the results of an analysis motivating our design and 
quantifying its performance benefits. This includes algorithms for keeping cached data 
consistent so that dynamic pages can be cached at the Web server and dynamic content 
can be served at the performance level of static content. We show that our system design 
is able to achieve cache hit ratios close to 100% for cached data ... 

Keywords: caching, dynamic content, performance analysis, prefetching, stochastic 
models, web sites 
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10 D ynamic services and analysis: E ngineering and hostin g ada ptive freshness- ! 
<|k sensitive web applications on data ce nters 

^ Wen-Syan Li, Oliver Po, Wang-Pin Hsiung, K. Selguk Candan, Divyakant Agrawal 

May 2003 Proceedings of the 12th international conference on World Wide Web 

WWW '03 
Publisher: ACM Press 

Full text available- f£| pdf(10 31 MB) Additional Information: fuM_ citation, abstract, references, citings, index 
• (A] ' terms 

Wide-area database replication technologies and the availability of content delivery 
networks allow Web applications to be hosted and served from powerful data centers. This 
form of application support requires a complete Web application suite to be distributed 
along with the database replicas. A major advantage of this approach is that dynamic 
content is served from locations closer to users, leading into reduced network latency and 
fast response times. However, this is achieved at the expense ... 

Keywords: database-driven web applications, dynamic content, freshness, response 
time, net-work latency, web acceleration 



11 Multimodal, multidevice and beyond: Dynamic g eneration of web mi g ratory interfaces 
&y Renata Bandelloni, Giulio Mori, Fabio Paterno 

September 2005 Proceedings of the 7th international conference on Human computer 
interaction with mobile devices & services MobileHCI '05 

Publisher: ACM Press 

_ „ . . ,, .=» ,, H TO Mm Additional Information: full citation, abstract, references, citings, index 

Full text available: TO pdf(j .72 MB) — - 

^ terms 

In this paper, we present a solution for dynamic generation of Web user interfaces that 
can dynamically migrate among different platforms. The solution is based on a 
migration/proxy server able to automatically convert a desktop service into a service 
accessible from a different platform, such as a mobile one. This solution can support new 
environments where users can freely move about and change interaction device while still 
continuing task performance and accessing the application in a usable ... 

Keywords: automatic transformations, migratory interfaces, model-based design, 
ubiquitous environments 



12 Session 8: distributed systems: Transparent cachin g with strong consistency in j 
J*y d ynamic content web sites 

^ Cristiana Amza, Gokul Soundararajan, Emmanuel Cecchet 

June 2005 Proceedings of the 19th annual international conference on 

Supercomputing ICS '05 
Publisher: ACM Press 

Full text available: *Q pdf(37Q.07 KB) Additional Information: full citation , abstract , references 

We consider a cluster architecture in which dynamic content is generated by a database 
back-end and a collection of Web and application server front-ends. We study the effect of 
transparent query caching on the performance of such a cluster. Transparency requires 
that cached entries be invalidated as a result of writes. We start with a coarse-grain table- 
level automatic invalidation cache. Based on observed workload characteristics, we 
enhance the cache with the necessary dependency tracking and ... 

13 A fra g ment-based approach for e ffi cient ly creatin g dy namic web con tent ; 
^ Jim Challenger, Paul Dantzig, Arun Iyengar, Karen Witting 

May 2005 ACM Transactions on Internet Technology (TOIT), Volume 5 issue 2 

Publisher: ACM Press 
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Full text available: ^g) pdf(2.33 MB) Additional Information: full citati on, abstract , reference s, citings, index 
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This article presents a publishing system for efficiently creating dynamic Web content. 
Complex Web pages are constructed from simpler fragments. Fragments may recursively 
embed other fragments. Relationships between Web pages and fragments are 
represented by object dependence graphs. We present algorithms for efficiently detecting 
and updating Web pages affected after one or more fragments change. We also present 
algorithms for publishing sets of Web pages consistently; different algorithms are ... 

Keywords: Caching, Web, Web performance, dynamiccontent, fragments, publishing 

14 Session 6: web services II: Model driven distribution pattern desi g n for dynamic web §g 

service compositions 
^ Ronan Barrett, Lucian M. Patcas, Claus Pahl, John Murphy 

July 2006 Proceedings of the 6th international conference on Web engineering ICWE 
06 

Publisher: ACM Press 

Full text available: ^_pdf(238;91 KB) Additional Information: full citation, abstract, references, indexjerms 

Web service compositions are often used to realise service-based enterpriseapplications. 
These enterprise systems are built from many existing discreteapplications, often legacy 
applications exposed using Web service interfaces. Acceptance of these systems is often 
constrained by non-functional aspects,such as Quality of Service (QoS). A number of 
factors affect the QoS of anenterprise system, including availability, scalability and 
performance. Thereare a number of architectural configurations o ... • 

Keywords: compositions, decentralisation, distribution patterns, mda, web services 



15 Session 4: modeling and tools I: Modelin g and generating a pplication lo g ic for data- 
^ intensive web a p plications 

^ Mihaly Jakob, Holger Schwarz, Fabian Kaiser, Bernhard Mitschang 

July 2006 Proceedings of the 6th international conference on Web engineering ICWE 
'06 

Publisher: ACM Press 

Full text available* f£| pdf(_283 71 KB). Additlonal Information: full citation , abstract , references , cited b y. index 

terms 

This paper presents a new approach for the development of data-intensive web 
applications that depend on sophisticated application logic. E-Commerce web sites, on- 
line auction systems and large enterprise web portals fall. into this category as they 
require comprehensive data access, data processing and data manipulation capabilities. 
However, existing methodologies mainly concentrate on modeling content, navigation and 
presentation aspects of read-only web sites. In our opinion these models are ... 

Keywords: application logic modeling, code generation, data-intensive applications, 
object-orientation, web application design 



16 Scalable systems for dynamic content: Globet p: tem plate-based database re plication 
<g> for scalable web a p plications 

^ Tobias Groothuyse, Swaminathan Sivasubramanian, Guillaume Pierre 

May 2007 Proceedings of the 16th international conference on World Wide Web 

WWW '07 
Publisher: ACM Press 

Full text available: l Q pdf(238.13 KB ) Additional Information: full citation , abstract , references , index terms 

Generic database replication algorithms do not scale linearly in throughput as all update, 
deletion and insertion (UDI) queries must be applied to every database replica. The 
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throughput is therefore limited to the point where the number of UDI queries alone is 
sufficient to overload one server. In such scenarios, partial replication of a database can 
help, as UDI queries are executed only by a subset of all servers. In this paper we 
propose GlobeTP, a system that employs partial replication t ... 

Keywords: database replication, partial replication, scalability, web applications 



17 D ynamic content acceleration: a cachin g solution to enable scalable dynamic Web 
^ page g eneration 

Anindya Datta, Kaushik Dutta, Krithi Ramamritham, Helen Thomas, Debra VanderMeer 
May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD international 

conference on Management of data SIGMOD '01, volume 30 issue 2 
Publisher: ACM Press 

Full text available: c Q.pdf(57,58_KB). Additional Information: full. citation, citings, index terms 



1 8 Engi neerin g server-driven co nsistency for large scale dynamic Web services 
J' an Y ' n / Lorenzo Alvisi, Mike Dahlin, Arun Iyengar 

April 2001 Proceedings of the 10th international conference on World Wide Web 
WWW '01 

Publisher: ACM Press 

Full text available: t jg?) pdf(291.44 KB ) Additional Information: full citation , references , citings, index terms 



Keywords: Web cache consistency, dynamic content, performance, scalability, volume 
lease 



19 Enablin g d ynamic content cachin g for database-driven web sites 

K. Selguk Candan, Wen-Syan Li, Qiong Luo, Wang-Pin Hsiung, Divyakant Agrawal 
May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD international 

conference on Management of data SIGMOD '01, volume 30 issue 2 
Publisher: ACM Press 

Full text available 1 "pi pdf(31 9 67 KB) Adc, ' tional Information: full cita tion, abstract, references, citings, index 
■ - / terms 

Web performance is a key differentiation among content providers. Snafus and slowdowns 
at major web sites demonstrate the difficulty that companies face trying to scale to a 
large amount of web traffic. One solution to this problem is to store web content at 
server-side and edge-caches for fast delivery to the end users. However, for many e- 
commerce sites, web pages are created dynamically based on the current state of 
business processes, represented in application servers and databases 
Keywords: JDBC, application server, database driven web site, dynamic content caching, 
invalidation, web acceleration 



20 Evaluating the performance of user-space and kernel-space web servers 
Amol Shukla, Lily Li, Anand Subramanian, Paul A. S. Ward, Tim Brecht 

October 2004 Proceedings of the 2004 conference of the Centre for Adva need Studies 

on Collaborative research CASCON '04 
Publisher: IBM Press 

Full text available' e P| pdf(91 70 KB) Additional Information: full citation, abstract , references , index term s. 
• [AJ = review 

There has been much debate over the past few years about the practice of moving 
traditional user-space applications, such as web servers, into the kernel for better 
performance. Recently, the user-space userver web server has shown promising 
performance for delivering static content. In this paper we first describe how we 
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augmented the userver to enable it to serve dynamic content. We then evaluate the 
performance of the userver and the kernel-space TUX web server, using the SPECweb99 
workloa ... 
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1 A scalable and hi ghl y available system for serving d ynamic data at frequently j 
accessed web sites 

Jim Challenger, Paul Dantzig, Arun Iyengar 

November 1998 Proceedings of the 1998 ACM/IEEE conference on Supercom puting 
(CDROM) Supercomputing '98 

Publisher: IEEE Computer Society 

Full text available: *Q pdf ( 195.17 K B) Additional Information: full c i tation , abstract , referen ces, citings 

This paper describes the system and key techniques used for achieving performance and 
high availability at the official Web site for the 1998 Olympic Winter Games which was one 
of the most popular Web sites for the duration of the Olympic Games. The Web site 
utilized thirteen SP2 systems scattered around the globe containing a total of 143 
processors. A key feature of the Web site was that the data being presented to clients was 
constantly changing. Whenever new results were entered into the sys ... 

2 Efficie ntl y servin g d ynamic data at highl y accessed web sites 

James R. Challenger, Paul Dantzig, Arun Iyengar, Mark S. Squillante, Li Zhang 
April 2004 IEEE/ACM Transactions on Networking (TON), volume 12 issue 2 

Publisher: IEEE Press 

Additional Information: full citation, abstract , references , citin gs, index 
terms 



Full text available: g pdf(499.05 KB) 



We present architectures and algorithms for efficiently serving dynamic data at highly 
accessed Web sites together with the results of an analysis motivating our design and 
quantifying its performance benefits. This includes algorithms for keeping cached data 
consistent so that dynamic pages can be cached at the Web server and dynamic content 
can be served at the performance level of static content. We show that our system design 
is able to achieve cache hit ratios close to 100% for cached data ... 

Keywords: caching, dynamic content, performance analysis, prefetching, stochastic 
models, web sites 



Engi neerin g server -driven consistency for large scale dynamic Web services 
Jian Yin, Lorenzo Alvisi, Mike Dahlin, Arun Iyengar 

April 2001 Proceedings of the 10th international conference on World Wide Web 
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4 En g ineering web cache consistenc y §^ 
Jian Yin, Lorenzo Alvisi, Mike Dahlin, Arun Iyengar 

August 2002 ACM Transactions on Internet Technology (TOIT), volume 2 issue 3 
Publisher: ACM Press 

Full text available" W\ pdf(403 96 KB) Ac,d ' t ' onal Information: full citation , abstract , references , citings, index 
' : terms 

Server-driven consistency protocols can reduce read latency and improve data freshness 
for a given network and server overhead, compared to the traditional consistency 
protocols that rely on client polling. Server-driven consistency protocols appear 
particularly attractive for large-scale dynamic Web workloads because dynamically 
generated data can change rapidly and unpredictably. However, there have been few 
reports on engineering server-driven consistency for such workloads. This article repo ... 

Keywords: Cache coherence, cache consistency, dynamic content, lease, scalability, 
volume 



Adaptive push-pull: dissem inating dynamic web data 

Pavan Deolasee, Amol Katkar, Ankur Panchbudhe, Krithi Ramamritham, Prashant Shenoy 
April 2001 Proceedings of the 10th international conference on World Wide Web 

WWW '01 
Publisher: ACM Press 

Full text available: *g| pdfd 52.08 KB ) Additional Information: full citation , references , citings, index terms 



Keywords: World Wide Web, data dissemination, dynamic data, pull, push, resiliency, 
scalability, temporal coherency 



A frag ment -based approach forjeffi cie ntly creating d ynamic web content 
Jim Challenger, Paul Dantzig, Arun Iyengar, Karen Witting 

May 2005 ACM Transactions on Internet Technology (TOIT), volume 5 issue 2 
Publisher: ACM Press 

Full text available* W\ df(2 33 MB) Additional Information: full citation, abstract, references, citings, index 
• ia) : terms 

This article presents a publishing system for efficiently creating dynamic Web content. 
Complex Web pages are constructed from simpler fragments. Fragments may recursively 
embed other fragments. Relationships between Web pages and fragments are 
represented by object dependence graphs. We present algorithms for efficiently detecting 
and updating Web pages affected after one or more fragments change. We also present 
algorithms for publishing sets of Web pages consistently; different algorithms are ... 

Keywords: Caching, Web, Web performance, dynamic content, fragments, publishing 



A client-aware dispatchin g alg orithm for web clusters providin g multi ple services 
^ Emiliano Casalicchio, Michele Colajanni 

April 2001 Proceedings of the 10th international conference on World Wide Web 
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Publisher: ACM Press 
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XML query processin g I: Dynamic XML documents with distri but i on a nd replication 
Serge Abiteboul, Angela Bonifati, Gregory Cobena, Ioana Manolescu, Tova Milo 
June 2003 Proceedings of the 2003 ACM SIGMOD i international conference on 

Management of data SIGMOD '03 
Publisher: ACM Press 

Full text available* "PI pdf(2Q9.Q6 K B) AdcJ itional Information: full citati on, abstract, references , citings, index 

~ terms 

The advent of XML as a universal exchange format, and of Web services as a basis for 
distributed computing, has fostered the apparition of a new class of documents: dynamic 
XML documents. These are XML documents where some data is given explicitly while 
other parts are given only intensionally by means of embedded calls to web services that 
can be called to generate the required information. By the sole presence of Web services, 
dynamic documents already include inherently some form of di ... 

The content and access dynamics of a busy Web site: findin g s and implications 
Venkata N. Padmanabhan, Lili Qiu 

August 2000 ACM SIGCOMM Computer Corrim unication Review , Proceedings of the 
conference on Applications, Technologies, Architectures, and Protocols 
for Computer Communication SIGCOMM '00, volume 30 issue 4 
Publisher: ACM Press 

Full text available: fB pdj(820.58 KB) Additional Information: full citation, abstract, references, citings, index 

" terms 

In this paper, we study the dynamics of the MSNBC news site, one of the busiest Web 
sites in the Internet today. Unlike many other efforts that have analyzed client accesses 
as seen by proxies, we focus on the server end. We analyze the dynamics. of both the 
server content and client accesses made to the server. The former considers the content 
creation and modification process while the latter considers page popularity and locality in 
client accesses. Some of our key results are: (a) files ... 

10 Analysis of lexical si g natures for improvin g information persistence on the World 
Wide Web 

Seung-Taek Park, David M. Pennock, C. Lee Giles, Robert Krovetz 
October 2004 ACM Transactions on Information Systems (TOIS), volume 22 issue 4 
Publisher: ACM Press 

Full text available: ffi pdf(8Q8.10 KB ) Additional information: full citation , abstract, references , citings, index 

terms 

A <i>lexical signature</i> (LS) consisting of several key words from a Web document is 
often sufficient information for finding the document later, even if its URL has changed. 
We conduct a large-scale empirical study of nine methods for generating lexical 
signatures, including Phelps and Wilensky's original proposal (PW), seven of our own 
static variations, and one new dynamic method. We examine their performance on the 
Web over a 10-month period, and on a TREC data set, evaluating t ... 

Keywords: Broken URLs, TREC, World Wide Web, dead links, digital libraries, indexing, 
information retrieval, inverse document frequency, lexical signatures, robust hyperlinks, 
search engines, term frequency 
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Summarizing web pages have recently gained much attention from researchers. Until now 
two main types of approaches have been proposed for this task: content- and context- 
based methods. Both of them assume fixed content and characteristics of web documents 
without considering their dynamic nature. However the volatility of information published 
on the Internet argue for the implementation of more time-aware techniques. This paper 
proposes a new approach towards automatic web page description, whi ... 

Keywords: change detection, web document, web page summarization 
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August 2002 Proceedings of the 28th international conference on Very Large Data 
Bases - Volume 28 VLD B '2002 

Publisher: VLDB Endowment 

Full text available: 'gj pdf( 14.12 MB ) Additional Information: full citati on, abstract , refere nces, indexjenris 

Response time is a key differentiation among electronic commerce (e-commerce) 
applications. For many e-commerce applications, Web pages are created dynamically 
based on the current state of a business stored in database systems. Recently, the topic 
of Web acceleration for database-driven Web applications has drawn a lot of attention in 
both the research community and commercial arena. In this paper, we analyze the factors 
that have impacts on the performance and scalability of Web application ... 

13 On disk cachin g of Web objects in proxy servers 
Charu G. Aggarwal, Philip S. Yu 

January 1997 Proceedings of the sixth international conference on Information and 

knowledge management CIKM '97 
Publisher: ACM Press 

Full text available: E g|pcif( 911.21 K B) Additional Information: full citatio n, ref erences , citings', index te rms 



14 Versioning and fragmentation: Automatic detection of fragments in dynamically 
<gy g enerated web pages 

^ Lakshmish Ramaswamy, Arun Iyengar, Ling Liu, Fred Douglis 

May 2004 Proceedings of the 13th international conference on World Wide Web 

WWW '04 
Publisher: ACM Press 

Full text available* pdf( 268 12 KB) Adc,jtional Information: full citation , abstract , references , citings, index 
' ^ ' terms 

Dividing web pages into fragments has been shown to provide significant benefits for both 
content generation and caching. In order for a web site to use fragment-based content 
generation, however, good methods are needed for dividing web pages into fragments. 
Manual fragmentation of web pages is expensive, error prone, and unscalable. This paper 
proposes a novel scheme to automatically detect and flag fragments that are cost- 
effective cache units in web sites serving dynamic content. We consider ... 

Keywords: L-P fragments, dynamic content caching, fragment detection, fragment-based 
caching, shared fragments 



15 Model-driven simulation o f World-Wide-W eb ca che policies 
Ying Shi, Edward Watson, Ye-sho Chen 

December 1997 Proceedings of the 29th conference on Winter simulation WSC '97 
Publisher: ACM Press, IEEE Computer Society 

Full text available: 'g) pdf(77 3.92 KB) Additional Information: fuN citation, references, citings, indexjerms 
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1 6 Applications: Efficient and transparent dyn amic co ntent updates for mobile clients 
Trevor Armstrong, Olivier Trescases, Cristiana Amza, Eyal de Lara 
June 2006 Proceedings of the 4th international conference on Mobile systems, 

applications and services MobiSys 2006 
Publisher: ACM Press 

Full text available: Q pdf (378.99 KB ) Additional Information: full citation , abstract , references , index terms 

We introduce a novel infrastructure supporting automatic updates for dynamic content 
browsing on resource constrained mobile devices. Currently, the client is forced to 
continuously poll for updates from potentially different data sources, such as, e- 
commerce, on-line auctions, stock and weather sites, to stay up to date with potential 
changes in content. We employ a pair of proxies, located on the mobile client and on a 
fully-connected edge server, respectively, to minimize the battery consumpt ... 

Keywords: batching, caching, energy measurement, mobile wireless communication, 
power management, pre-fetching, proxy 

17 Con sistency and replication: Evaluation of edge caching/offloading for dynamic Q 

<|k content deliv ery 
^ Chun Yuan, Yu Chen, Zheng Zhang 

May 2003 Proceedings of the 12th international conference on World Wide Web 

WWW '03 
Publisher: ACM Press 

Full text available- Hi pdf(161 .49 KB) Additional Information: full citation , abstract, references, citings, index 
^ terms 

As dynamic content becomes increasingly dominant, it becomes an important research 
topic as how the edge resources such as client-side proxies, which are otherwise 
underutilized for such content, can be put into use. However, it is unclear what will be the 
best strategy and the design/deployment tradeoffs lie therein. In this paper, using one 
representative e-commerce benchmark, we report our experience of an extensive 
investigation of different offloading and caching options. Our results point ... 

Keywords: dynamic content, edge caching, offloading 

18 Server performance and scalability : Challenges and practices in de ploying web Q 
acceleration solutions for dis tributed enterpri se systems 

Wen-Syan Li, Wang-Pin Hsiung, Oliver Po, Koji Hino, Kasim Selcuk Candan, Divyakant 
Agrawal 

May 2004 Proceedings of the 13th international conference on World Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available: Qpdf(6. 61 MB) Additional information: fuU citation, abstract, references, indexjerms 

For most Web-based applications, contents are created dynamically based on the current 
state of a business, such as product prices and inventory, stored in database systems. 
These applications demand personalized content and track user behavior while 
maintaining application integrity. Many of such practices are not compatible with Web 
acceleration solutions. Consequently, although many web acceleration solutions have 
shown promising performance improvement and scalability, architecting and engin ... 

Keywords: application server, dynamic content, edge server, fragment, j2ee, reliability, 
scalability, web acceleration 
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^ May 2003 Proceedings of the 12th international conference on World Wide Web 
WWW '03 

Publisher: ACM Press 

Full text available- f?| pdf(151 32 KB) Adc, ' tional Information: full citation , abstra ct, references , citings, index 
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Continuous queries are queries for which responses given to users must be continuously 
updated, as the sources of interest get updated. Such queries occur, for instance, during 
on-line decision making, e.g., traffic flow control, weather monitoring, etc. The problem of 
keeping the responses current reduces to the problem of deciding how often to visit a 
source to determine if and how it has been modified, in order to update earlier responses 
accordingly. On the surface, this seems to be similar ... 

Keywords: allocation policies, continuous queries 



20 Ap plications: From HTTP to HTML: Erlan g /OTP experiences in web based service |pj 
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^ Francesco Cesarinj, Lukas Larsson, Michal Slaski 

September 2006 Proceedings of the 2006 ACM SIGPLAN workshop on Erlang ERLANG 
'06 

Publisher: ACM Press 

Full text available: pdf(536.25 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes the lessons learnt when internally developing web applications in 
Erlang. On the basis of these experiences, a framework called the Web Platform has been 
implemented. The Web Platform follows a design pattern separating data processing and 
formatting, allowing the construction of flexible and maintainable software architectures. 
It also delivers mechanisms for building dynamic pages and components. On top of the 
platform and components, web interfaces to commercial Erlang sy ... 

Keywords: HTML, HTTP, erlang, web frameworks 
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1 Industrial and practical experience track pa per session 1: The volume and evolution 
^ of web pag e templates 

David Gibson, Kunal Punera, Andrew Tomkins 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web WWW '05 

Publisher: ACM Press 

Full text available' *Wi pdf(249 32 KB) Additional Information: full citation, abstract, references, citings, index 
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Web pages contain a combination of unique content and template material, which is 
present across multiple pages and used primarily for formatting, navigation, and 
branding. We study the nature, evolution, and prevalence of these templates on the web. 
As part of this work, we develop new randomized algorithms for template extraction that 
perform approximately twenty times faster than existing approaches with similar quality. 
Our results show that 40—50% of the content on the web is templa ... 

Keywords: algorithms, boilerplate, data cleaning, data mining, templates, web mining 



2 Detection and evidence: A fast and robust method for web pag e template detection 
and removal 

Karane Vieira, Altigran S. da Silva, Nick Pinto, Edleno S. de Moura, Joao M. B. Cavalcanti, 
Juliana Freire 

November 2006 Proceedings of the 15th ACM international conference on Information 
and knowledge management CIKM '06 

Publisher: ACM Press 

Full text available: pdf( 316.2 0 KB) Additional Information: full citation , abstra ct, references, i ndex term s 

The widespread use of templates on the Web is considered harmful for two main reasons. 
Not only do they compromise the relevance judgment of many web IR and web mining 
methods such as clustering and classification, but they also negatively impact the 
performance and resource usage of tools that process web pages. In this paper we 
present a new method that efficiently and accurately removes templates found in 
collections of web pages. Our method works in two steps. First, the costly process of te ... 



Keywords: web page noise removal, web template extraction 
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November 2004 Proceedings of the 6th annual ACM international workshop on Web 

information and data management WIDM '04 
Publisher: ACM Press 

Full text available: "g) pdf(563.56 KB) Additional Information: full citation , abstract , references , index terms 

Hidden Web databases maintain a collection of specialised documents, which are 
dynamically generated in response to users' queries. However, the documents are 
generated by Web page templates, which contain information that is irrelevant to queries. 
This paper presents a Two-Phase Sampling (2PS) technique that detects templates and 
extracts query-related information from the sampled documents of a database. In the first 
phase, 2PS queries databases with terms contained in their search interfac ... 

Keywords: document sampling, hidden web databases, information extraction 



4 Tree-Stru ctured Template Generation for Web J^ages | 
Shui-Lung Chuang, jane Yung-jen Hsu 

September 2004 Proceedings of the 2004 IEEE/WIC/ACM International Conference on 
Web Intelligence WI '04 

Publisher: IEEE Computer Society 

Full text available: fg)pdf(1 81 .46 KB) 
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As the web becomes an increasingly important source of information, tools for modeling, 
searching, and extracting information from Web pages are indispensable. By modeling the 
structure of a Web page defined by its markup tags, one can easily extract target 
information using structural templates. This paper introduces the Tree Template 
Automatic Generator (TTAG) that learns tree-structured templates from training Web 
pages. TTAG was applied to both query-based and frequently updated Web sites, a ... 

5 Research track pa pers: Minin g te mplates from search result records of search 
<|> engines 

" Hongkun Zhao, Weiyi Meng, Clement Yu 

August 2007 Proceedings of the 13th ACM SIGKDD international conference on 

Knowledge discovery and data mining KDD f 07 
Publisher: ACM Press 

Full text available: pdf(97 2.30 KB) Additional Information: full citation, abstract, references, indexjejms 

Metasearch engine, Comparison-shopping and Deep Web crawling applications need to 
extract search result records enwrapped in result pages returned from search engines in 
response to user queries. The search result records from a given search engine are 
usually formatted based on a template. Precisely identifying this template can greatly help 
extract and annotate the data units within each record correctly. In this paper, we 
propose a graph model to represent record template and develop a dom ... 

Keywords: information extraction, search engine, wrapper generation 
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Lan Yi, Bing Liu, Xiaoli Li 

August 2003 Proceedings of the ninth ACM SIGKDD international conference on 

Knowledge discovery and data mining KDD '03 
Publisher: ACM Press 
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A commercial Web page typically contains many information blocks. Apart from the main 
content blocks, it usually has such blocks as navigation panels, copyright and privacy 
notices, and advertisements (for business purposes and for easy user access). We call 
these blocks that are not the main content blocks of the page the noisy blocks. We show 
that the information contained in these noisy blocks can seriously harm Web data mining. 
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7 Poster pap ers - short pa pers: Extracting u nstructured data from template g ene rated 
web documents 

Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Chung 

November 2003 Proceedings of the twelfth international conference on Information 

and knowledge management CIKM '03 
Publisher: ACM Press 

Full text available- "PI odf(210 48 KB) Additiona ' Information: full citation , abstract, references , citin gs, index 
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We propose a novel approach that identifies web page templates and extracts the 
unstructured data. Extracting only the body of the page and eliminating the template 
increases the retrieval precision for the queries that generate irrelevant results. We 
believe that by reducing the number of irrelevant results; the users are encouraged to go 
back to a given site to search, Gur experimental results on several different web sites and 
on the whole cnnfn collection demonstrate the feasibility of our a ... 

Keywords: automatic template removal, information retrieval, retrieval accuracy, text 
extraction 



8 Information access and retrieval (IAR): Template detection for Jaigescaje search 
<§> engines 

^ Liang, Chen, Shaozhi Ye,. Xing Li 

April 2006 Proceedings of the 2006 ACM symposium on Applied computing SAC '06 
Publisher: ACM Press 

Full text available' *W\ df(416.8t KB) Addit ' onal Information: full citati on, abstract , references , cited b y. index 
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Templates in web sites hurt search engine retrieval performance, especially in content 
relevance and link analysis. Current template removal methods suffer from processing 
speed and scalability when dealing with large volume web pages. In this paper, we 
propose a novel two-stage template detection method, which combines template 
detection and removal with the index building process of a search engine. First, web 
pages are segmented into blocks and blocks are. clustered according to. their style fe ... 

Keywords: clustering, template detection, web page segmentation 



9 Novel web a p plications: The portra it of a common HTML web page 
Ryan Levering, Michal Cutler 

October 2006 Proceedings of the 2006 ACM symposium on Document engineering 
DocEng '06 

Publisher: ACM Press 

Full text available: f^l pdf( 270.72 KB) Additional Information: fyicitatLo_n, abstract, references, indexjerms 



Web pages are not purely text, nor are they solely HTML. This paper surveys HTML web 
pages; not only on textual content, but with an emphasis on higher order visual features 
and supplementary technology. Using a crawler with an in-house developed rendering 
engine, data on a pseudo-random sample of web pages is collected. First, several basic 
attributes are collected to verify the collection process and confirm certain assumptions on 
web page text. Next, we take a look at the distribution of diff ... 

Keywords: CSS, HTML, feature, javascript, script, style, survey, visual, world wide web 
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Measurin g and characterizin g end-to-end Internet service performance 
Ludmila Cherkasova, Yun Fu, Wenting Tang, Amin Vahdat 

November 2003 ACM Transactions on Internet Technology (TOIT), Volume 3 issue 4 
Publisher: ACM Press 

Full text available* fiD pdf(1 46 MB) Additional Information: full citation , abstract , references , citings, index 
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Fundamental to the design of reliable, high-performance network services is an 
understanding of the performance characteristics of the service as perceived by the client 
population as a whole. Understanding and measuring such end-to-end service 
performance is a challenging task. Current techniques include periodic sampling of service 
characteristics from strategic locations in the network and instrumenting Web pages with 
code that reports client-perceived latency back to a performance server. Li ... 

Keywords: End-to-end service performance, QoS, network packet traces, passive 
monitoring, reconstruction of web page composition, web site performance 



11 Pa per session IR-3 (information retriev al ): web retrieval: Person resolution in person 
search results: WebHawk 
Xiaojun Wan, Jianfeng Gao, Mu Li, Binggong Ding 

October 2005 Proceedings of the 14th ACM international conference on Information 

and knowledge management CIKM '05 
Publisher: ACM Press 

Full text available: t Q pdf(616.03 KB) Additional Information: full citation , abstract , references , index terms 

Finding information about people on the Web using a search engine is difficult because 
there is a many-to-many mapping between person names and specific persons (i.e. 
referents). This paper describes a person resolution system, called WebHawk. Given a 
list of pages obtained by submitting a person query to a search engine, WebHawk 
facilitates person search in three steps: First of all, a filter removes those pages that 
contain no information about any person. Secondly, a c ... 

Keywords: clustering, junk filtering, person resolution, person search 
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Le Phong Bao Vuong, Xiaoying Gao, Mengjie Zhang 

December 2006 Proceedings of the 2006 IEEE/WIC/ACM International Conference on 
Web Intelligence WI 06 

Publisher: IEEE Computer Society 

Full text available: l g| pdf (149.77 KB ) Additional Information: full citation , abstract, index terms 

This paper introduces an approach to the use of clustering for data extraction from semi- 
structured Web pages. A variant Hierarchical Agglomerative Clustering (HAC) algorithm K- 
neighbours-HAC is developed which uses the similarities of the data format (HTML tags) 
and the data content (text string values) to group similar text tokens into clusters. Using 
these clusters, similar text tokens are identi- fied as data fields and extracted as target 
information. The approach is examined and compared w ... 

13 Learning classifiers: Learnin g block importance models for web pages 
A Ruihua Song, Haifeng Liu, Ji-Rong Wen, Wei-Ying Ma 

V May 2004 Proceedings of the 13th international conference on World Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available" fll pdf(1 23 MB) Additional Information: fuJLcMion, abstract, references, citings, index 
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Previous work shows that a web page can be partitioned into multiple segments or blocks, 
and often the importance of those blocks in a page is not equivalent. Also, it has been 
proven that differentiating noisy or unimportant blocks from pages can facilitate web 
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mining, search and accessibility. However, no uniform approach and model has been 
presented to measure the importance of different segments in web pages. Through a user 
study, we found that people do have a consistent view about the impo ... 

Keywords: block importance model, classification, page segmentation, web mining 
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March 2000 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: t g[ pdf(281.14 KB) Additional Information: full citation , abstract, citings, index terms 

The analysis of web usage has mostly focused on sites composed of conventional static 
pages. However, huge amounts of information available in the web come from databases 
or other data collections and are presented to the users in the form of dynamically 
generated pages. The query interfaces of such sites allow the specification of many search 
criteria. Their generated results support navigation to pages of results combining cross- 
linked data from many sources. For the analysis of visitor naviga ... 

Keywords: Conceptual hierarchies, Data mining, Query capabilities, Web databases, Web 
query interfaces, Web usage mining 
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Publisher: ACM Press 

Full text available: ^ pdf(601 .78 KB) Additional Information: full citati on, abstract , references 

' Previous work shows that a web page can be partitioned into multiple segments or blocks, 
and often the importance of those blocks in a page is not equivalent. It has also been 
proven that differentiating noisy and unimportant blocks from pages can facilitate web 
mining, search and accessibility. However, no uniform approach and model has been 
presented to measure the importance of different blocks in a web page. Through a user 
study, we found that people do have a consistent view about the impor ... 

Keywords: block importance model, classification, page segmentation, web mining 
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Fabio Gasparetti, Alessandro Micarelli 

January 2007 Proceedings of the 12th international conference on Intelligent user 
interfaces IUI '07 

Publisher: ACM Press 

Full text available:^ pdf(57Q. 20 KB) Additional Information: full citation , abstract , references , index terms 

Browsing activities are an important source of information to build profiles of the user 
interests and personalize the human-computer interaction during information seeking 
tasks. Visited pages are easily collectible, e.g., from browsers' histories and toolbars, or 
desktop search tools, and they often contain documents related to the current user needs. 
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